Overview

Brought to you by YData

Dataset statistics

Number of variables10
Number of observations3276
Missing cells1434
Missing cells (%)4.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory256.1 KiB
Average record size in memory80.0 B

Variable types

Numeric9
Categorical1

Alerts

ph has 491 (15.0%) missing values Missing
Sulfate has 781 (23.8%) missing values Missing
Trihalomethanes has 162 (4.9%) missing values Missing
Hardness has unique values Unique
Solids has unique values Unique
Chloramines has unique values Unique
Conductivity has unique values Unique
Organic_carbon has unique values Unique
Turbidity has unique values Unique

Reproduction

Analysis started2024-12-14 16:50:49.592841
Analysis finished2024-12-14 16:50:55.847027
Duration6.25 seconds
Software versionydata-profiling vv4.12.1
Download configurationconfig.json

Variables

ph
Real number (ℝ)

Missing 

Distinct2785
Distinct (%)100.0%
Missing491
Missing (%)15.0%
Infinite0
Infinite (%)0.0%
Mean7.0807945
Minimum0
Maximum14
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size25.7 KiB
2024-12-14T22:20:55.903392image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile4.4879707
Q16.0930919
median7.0367521
Q38.0620661
95-th percentile9.7898186
Maximum14
Range14
Interquartile range (IQR)1.9689742

Descriptive statistics

Standard deviation1.5943195
Coefficient of variation (CV)0.22516111
Kurtosis0.72031558
Mean7.0807945
Median Absolute Deviation (MAD)0.984117
Skewness0.025630448
Sum19720.013
Variance2.5418547
MonotonicityNot monotonic
2024-12-14T22:20:55.985213image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
8.55409697 1
 
< 0.1%
6.538084087 1
 
< 0.1%
5.91580675 1
 
< 0.1%
8.136497869 1
 
< 0.1%
6.493764175 1
 
< 0.1%
6.977405633 1
 
< 0.1%
5.489248055 1
 
< 0.1%
2.558102799 1
 
< 0.1%
7.312109304 1
 
< 0.1%
6.704431913 1
 
< 0.1%
Other values (2775) 2775
84.7%
(Missing) 491
 
15.0%
ValueCountFrequency (%)
0 1
< 0.1%
0.2274990502 1
< 0.1%
0.9755779898 1
< 0.1%
0.9899122129 1
< 0.1%
1.431781555 1
< 0.1%
1.757037115 1
< 0.1%
1.844538366 1
< 0.1%
1.985383359 1
< 0.1%
2.128531434 1
< 0.1%
2.376768076 1
< 0.1%
ValueCountFrequency (%)
14 1
< 0.1%
13.54124024 1
< 0.1%
13.34988856 1
< 0.1%
13.17540172 1
< 0.1%
12.24692807 1
< 0.1%
11.90773983 1
< 0.1%
11.89807803 1
< 0.1%
11.62114013 1
< 0.1%
11.56876797 1
< 0.1%
11.56316906 1
< 0.1%

Hardness
Real number (ℝ)

Unique 

Distinct3276
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean196.3695
Minimum47.432
Maximum323.124
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size25.7 KiB
2024-12-14T22:20:56.064820image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum47.432
5-th percentile141.76328
Q1176.85054
median196.96763
Q3216.66746
95-th percentile249.60977
Maximum323.124
Range275.692
Interquartile range (IQR)39.816918

Descriptive statistics

Standard deviation32.879761
Coefficient of variation (CV)0.16743823
Kurtosis0.61577168
Mean196.3695
Median Absolute Deviation (MAD)19.844989
Skewness-0.039341705
Sum643306.47
Variance1081.0787
MonotonicityNot monotonic
2024-12-14T22:20:56.155411image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
204.8904555 1
 
< 0.1%
134.5602761 1
 
< 0.1%
170.1909123 1
 
< 0.1%
237.4610992 1
 
< 0.1%
171.2389255 1
 
< 0.1%
197.4281988 1
 
< 0.1%
195.7440741 1
 
< 0.1%
184.2318535 1
 
< 0.1%
187.8732835 1
 
< 0.1%
205.1505644 1
 
< 0.1%
Other values (3266) 3266
99.7%
ValueCountFrequency (%)
47.432 1
< 0.1%
73.49223369 1
< 0.1%
77.4595861 1
< 0.1%
81.71089527 1
< 0.1%
94.09130748 1
< 0.1%
94.81254522 1
< 0.1%
94.90897713 1
< 0.1%
97.2809086 1
< 0.1%
98.3679149 1
< 0.1%
98.45293051 1
< 0.1%
ValueCountFrequency (%)
323.124 1
< 0.1%
317.3381241 1
< 0.1%
311.3839565 1
< 0.1%
308.2538329 1
< 0.1%
307.7060241 1
< 0.1%
306.6274814 1
< 0.1%
304.2359121 1
< 0.1%
303.7026267 1
< 0.1%
300.2924758 1
< 0.1%
298.0986795 1
< 0.1%

Solids
Real number (ℝ)

Unique 

Distinct3276
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22014.093
Minimum320.94261
Maximum61227.196
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size25.7 KiB
2024-12-14T22:20:56.243055image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum320.94261
5-th percentile9545.8126
Q115666.69
median20927.834
Q327332.762
95-th percentile38474.99
Maximum61227.196
Range60906.253
Interquartile range (IQR)11666.072

Descriptive statistics

Standard deviation8768.5708
Coefficient of variation (CV)0.39831625
Kurtosis0.44282609
Mean22014.093
Median Absolute Deviation (MAD)5809.4719
Skewness0.62163449
Sum72118167
Variance76887834
MonotonicityNot monotonic
2024-12-14T22:20:56.334674image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20791.31898 1
 
< 0.1%
15979.33479 1
 
< 0.1%
37000.95567 1
 
< 0.1%
18736.1909 1
 
< 0.1%
12289.90092 1
 
< 0.1%
15979.06027 1
 
< 0.1%
12431.80311 1
 
< 0.1%
30031.83918 1
 
< 0.1%
29532.615 1
 
< 0.1%
19821.33837 1
 
< 0.1%
Other values (3266) 3266
99.7%
ValueCountFrequency (%)
320.9426113 1
< 0.1%
728.7508296 1
< 0.1%
1198.943699 1
< 0.1%
1351.906979 1
< 0.1%
1372.091043 1
< 0.1%
2552.962804 1
< 0.1%
2808.025756 1
< 0.1%
2835.303165 1
< 0.1%
2912.211247 1
< 0.1%
3413.081633 1
< 0.1%
ValueCountFrequency (%)
61227.19601 1
< 0.1%
56867.85924 1
< 0.1%
56488.67241 1
< 0.1%
56351.3963 1
< 0.1%
56320.58698 1
< 0.1%
55334.7028 1
< 0.1%
53735.89919 1
< 0.1%
52318.9173 1
< 0.1%
52060.2268 1
< 0.1%
51731.82055 1
< 0.1%

Chloramines
Real number (ℝ)

Unique 

Distinct3276
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.1222768
Minimum0.352
Maximum13.127
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size25.7 KiB
2024-12-14T22:20:56.420346image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0.352
5-th percentile4.5030537
Q16.1274208
median7.130299
Q38.114887
95-th percentile9.7531005
Maximum13.127
Range12.775
Interquartile range (IQR)1.9874663

Descriptive statistics

Standard deviation1.5830849
Coefficient of variation (CV)0.22227231
Kurtosis0.58990117
Mean7.1222768
Median Absolute Deviation (MAD)0.99166134
Skewness-0.01209844
Sum23332.579
Variance2.5061578
MonotonicityNot monotonic
2024-12-14T22:20:56.508012image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7.300211873 1
 
< 0.1%
9.504361027 1
 
< 0.1%
6.217222542 1
 
< 0.1%
5.599870342 1
 
< 0.1%
10.78649982 1
 
< 0.1%
7.424944591 1
 
< 0.1%
6.6616162 1
 
< 0.1%
6.21530731 1
 
< 0.1%
7.981036899 1
 
< 0.1%
6.344963412 1
 
< 0.1%
Other values (3266) 3266
99.7%
ValueCountFrequency (%)
0.352 1
< 0.1%
0.5303512947 1
< 0.1%
1.390870905 1
< 0.1%
1.683992581 1
< 0.1%
1.920271449 1
< 0.1%
2.102690991 1
< 0.1%
2.386653494 1
< 0.1%
2.39798499 1
< 0.1%
2.456013596 1
< 0.1%
2.458609195 1
< 0.1%
ValueCountFrequency (%)
13.127 1
< 0.1%
13.04380611 1
< 0.1%
12.91218664 1
< 0.1%
12.65336202 1
< 0.1%
12.62689974 1
< 0.1%
12.58002649 1
< 0.1%
12.36328483 1
< 0.1%
12.27937418 1
< 0.1%
12.2463941 1
< 0.1%
12.22717528 1
< 0.1%

Sulfate
Real number (ℝ)

Missing 

Distinct2495
Distinct (%)100.0%
Missing781
Missing (%)23.8%
Infinite0
Infinite (%)0.0%
Mean333.77578
Minimum129
Maximum481.03064
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size25.7 KiB
2024-12-14T22:20:56.587403image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum129
5-th percentile266.61623
Q1307.6995
median333.07355
Q3359.95017
95-th percentile403.07019
Maximum481.03064
Range352.03064
Interquartile range (IQR)52.250673

Descriptive statistics

Standard deviation41.41684
Coefficient of variation (CV)0.12408582
Kurtosis0.64826281
Mean333.77578
Median Absolute Deviation (MAD)26.095176
Skewness-0.035946622
Sum832770.56
Variance1715.3547
MonotonicityNot monotonic
2024-12-14T22:20:56.669686image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
280.7456229 1
 
< 0.1%
332.7445192 1
 
< 0.1%
391.9182286 1
 
< 0.1%
330.9053704 1
 
< 0.1%
402.3134271 1
 
< 0.1%
360.6978151 1
 
< 0.1%
336.0404518 1
 
< 0.1%
405.5273372 1
 
< 0.1%
346.0636768 1
 
< 0.1%
368.5164413 1
 
< 0.1%
Other values (2485) 2485
75.9%
(Missing) 781
 
23.8%
ValueCountFrequency (%)
129 1
< 0.1%
180.2067464 1
< 0.1%
182.3973702 1
< 0.1%
187.1707144 1
< 0.1%
187.4241309 1
< 0.1%
192.0335917 1
< 0.1%
203.4445208 1
< 0.1%
205.9350906 1
< 0.1%
206.2472294 1
< 0.1%
207.8904823 1
< 0.1%
ValueCountFrequency (%)
481.0306423 1
< 0.1%
476.5397173 1
< 0.1%
475.7374602 1
< 0.1%
462.474215 1
< 0.1%
460.107069 1
< 0.1%
458.4410723 1
< 0.1%
455.4512337 1
< 0.1%
450.9144544 1
< 0.1%
449.2676875 1
< 0.1%
447.4179624 1
< 0.1%

Conductivity
Real number (ℝ)

Unique 

Distinct3276
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean426.20511
Minimum181.48375
Maximum753.34262
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size25.7 KiB
2024-12-14T22:20:56.751320image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum181.48375
5-th percentile300.10947
Q1365.73441
median421.88497
Q3481.7923
95-th percentile566.34932
Maximum753.34262
Range571.85887
Interquartile range (IQR)116.05789

Descriptive statistics

Standard deviation80.824064
Coefficient of variation (CV)0.18963654
Kurtosis-0.27709283
Mean426.20511
Median Absolute Deviation (MAD)57.887591
Skewness0.26449022
Sum1396247.9
Variance6532.5293
MonotonicityNot monotonic
2024-12-14T22:20:56.834441image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
564.3086542 1
 
< 0.1%
418.6420628 1
 
< 0.1%
517.5767619 1
 
< 0.1%
235.0422835 1
 
< 0.1%
501.5597252 1
 
< 0.1%
452.1872326 1
 
< 0.1%
367.8540248 1
 
< 0.1%
400.6118991 1
 
< 0.1%
469.1321169 1
 
< 0.1%
482.5957093 1
 
< 0.1%
Other values (3266) 3266
99.7%
ValueCountFrequency (%)
181.483754 1
< 0.1%
201.6197368 1
< 0.1%
210.319182 1
< 0.1%
217.3583296 1
< 0.1%
232.613624 1
< 0.1%
233.9079651 1
< 0.1%
235.0422835 1
< 0.1%
245.859632 1
< 0.1%
247.9180305 1
< 0.1%
251.0208987 1
< 0.1%
ValueCountFrequency (%)
753.3426196 1
< 0.1%
708.2263645 1
< 0.1%
695.369528 1
< 0.1%
674.4434759 1
< 0.1%
672.5569992 1
< 0.1%
669.7250862 1
< 0.1%
666.6906183 1
< 0.1%
660.2549463 1
< 0.1%
657.5704218 1
< 0.1%
656.9241278 1
< 0.1%

Organic_carbon
Real number (ℝ)

Unique 

Distinct3276
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14.28497
Minimum2.2
Maximum28.3
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size25.7 KiB
2024-12-14T22:20:56.916816image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum2.2
5-th percentile8.8153617
Q112.065801
median14.218338
Q316.557652
95-th percentile19.637254
Maximum28.3
Range26.1
Interquartile range (IQR)4.4918502

Descriptive statistics

Standard deviation3.308162
Coefficient of variation (CV)0.2315834
Kurtosis0.044409307
Mean14.28497
Median Absolute Deviation (MAD)2.2322941
Skewness0.025532582
Sum46797.563
Variance10.943936
MonotonicityNot monotonic
2024-12-14T22:20:57.002936image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10.37978308 1
 
< 0.1%
12.89763545 1
 
< 0.1%
15.87176979 1
 
< 0.1%
11.545477 1
 
< 0.1%
12.28433352 1
 
< 0.1%
18.58495937 1
 
< 0.1%
21.30064694 1
 
< 0.1%
15.28878163 1
 
< 0.1%
16.1692117 1
 
< 0.1%
12.16473568 1
 
< 0.1%
Other values (3266) 3266
99.7%
ValueCountFrequency (%)
2.2 1
< 0.1%
4.371898608 1
< 0.1%
4.466771969 1
< 0.1%
4.473092264 1
< 0.1%
4.861631498 1
< 0.1%
4.902888068 1
< 0.1%
4.966861619 1
< 0.1%
5.051694615 1
< 0.1%
5.159380308 1
< 0.1%
5.188466455 1
< 0.1%
ValueCountFrequency (%)
28.3 1
< 0.1%
27.00670661 1
< 0.1%
24.75539237 1
< 0.1%
23.95245044 1
< 0.1%
23.91760126 1
< 0.1%
23.66766678 1
< 0.1%
23.60429797 1
< 0.1%
23.56964491 1
< 0.1%
23.51477377 1
< 0.1%
23.39951606 1
< 0.1%

Trihalomethanes
Real number (ℝ)

Missing 

Distinct3114
Distinct (%)100.0%
Missing162
Missing (%)4.9%
Infinite0
Infinite (%)0.0%
Mean66.396293
Minimum0.738
Maximum124
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size25.7 KiB
2024-12-14T22:20:57.085732image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0.738
5-th percentile39.552928
Q155.844536
median66.622485
Q377.337473
95-th percentile92.124059
Maximum124
Range123.262
Interquartile range (IQR)21.492937

Descriptive statistics

Standard deviation16.175008
Coefficient of variation (CV)0.24361313
Kurtosis0.23859744
Mean66.396293
Median Absolute Deviation (MAD)10.742172
Skewness-0.083030674
Sum206758.06
Variance261.6309
MonotonicityNot monotonic
2024-12-14T22:20:57.172102image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
86.99097046 1
 
< 0.1%
56.71550955 1
 
< 0.1%
77.73081437 1
 
< 0.1%
90.39489472 1
 
< 0.1%
37.78709664 1
 
< 0.1%
78.9255271 1
 
< 0.1%
89.47771837 1
 
< 0.1%
69.526718 1
 
< 0.1%
72.57395938 1
 
< 0.1%
57.78086932 1
 
< 0.1%
Other values (3104) 3104
94.7%
(Missing) 162
 
4.9%
ValueCountFrequency (%)
0.738 1
< 0.1%
8.175876384 1
< 0.1%
8.577012933 1
< 0.1%
14.34316145 1
< 0.1%
15.6848768 1
< 0.1%
16.2915046 1
< 0.1%
17.00068293 1
< 0.1%
17.52776496 1
< 0.1%
17.91572257 1
< 0.1%
18.01527236 1
< 0.1%
ValueCountFrequency (%)
124 1
< 0.1%
120.030077 1
< 0.1%
118.3572747 1
< 0.1%
116.1616216 1
< 0.1%
114.2086714 1
< 0.1%
114.0349457 1
< 0.1%
113.0488857 1
< 0.1%
112.622733 1
< 0.1%
112.4122104 1
< 0.1%
112.0610274 1
< 0.1%

Turbidity
Real number (ℝ)

Unique 

Distinct3276
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.9667862
Minimum1.45
Maximum6.739
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size25.7 KiB
2024-12-14T22:20:57.256499image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum1.45
5-th percentile2.6842792
Q13.4397109
median3.9550276
Q34.5003198
95-th percentile5.2209245
Maximum6.739
Range5.289
Interquartile range (IQR)1.0606089

Descriptive statistics

Standard deviation0.78038241
Coefficient of variation (CV)0.19672913
Kurtosis-0.062800641
Mean3.9667862
Median Absolute Deviation (MAD)0.53029624
Skewness-0.0078166424
Sum12995.191
Variance0.6089967
MonotonicityNot monotonic
2024-12-14T22:20:57.552923image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2.963135381 1
 
< 0.1%
3.987012091 1
 
< 0.1%
4.066229364 1
 
< 0.1%
3.759326201 1
 
< 0.1%
4.876273 1
 
< 0.1%
5.143750122 1
 
< 0.1%
4.513200539 1
 
< 0.1%
4.20418585 1
 
< 0.1%
4.586748359 1
 
< 0.1%
4.910911021 1
 
< 0.1%
Other values (3266) 3266
99.7%
ValueCountFrequency (%)
1.45 1
< 0.1%
1.492206615 1
< 0.1%
1.496100943 1
< 0.1%
1.64151501 1
< 0.1%
1.659799385 1
< 0.1%
1.680554025 1
< 0.1%
1.687624505 1
< 0.1%
1.801326999 1
< 0.1%
1.81252894 1
< 0.1%
1.844371604 1
< 0.1%
ValueCountFrequency (%)
6.739 1
< 0.1%
6.494748556 1
< 0.1%
6.494249467 1
< 0.1%
6.389161009 1
< 0.1%
6.35743852 1
< 0.1%
6.307678472 1
< 0.1%
6.226580405 1
< 0.1%
6.204846359 1
< 0.1%
6.099631873 1
< 0.1%
6.083772354 1
< 0.1%

Potability
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size25.7 KiB
0
1998 
1
1278 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters3276
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 1998
61.0%
1 1278
39.0%

Length

2024-12-14T22:20:57.627003image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-12-14T22:20:57.684787image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
ValueCountFrequency (%)
0 1998
61.0%
1 1278
39.0%

Most occurring characters

ValueCountFrequency (%)
0 1998
61.0%
1 1278
39.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 3276
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 1998
61.0%
1 1278
39.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 3276
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 1998
61.0%
1 1278
39.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 3276
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 1998
61.0%
1 1278
39.0%

Interactions

2024-12-14T22:20:55.060810image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:49.801010image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:50.388041image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:50.975214image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:52.238837image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:52.816550image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:53.402087image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:53.956321image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:54.501834image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:55.119083image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:49.908343image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:50.463575image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:51.035206image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:52.301048image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:52.879281image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:53.466975image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:54.016952image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:54.565253image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:55.179623image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:49.982294image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:50.529686image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:51.096892image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:52.359966image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:52.941097image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:53.524558image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:54.075016image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:54.623513image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:55.239248image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:50.040950image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:50.597167image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:51.161304image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:52.445480image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:53.019563image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:53.587242image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:54.132653image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:54.683646image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:55.297767image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:50.094973image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:50.661102image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:51.228581image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:52.513565image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:53.101953image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:53.652693image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:54.208284image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:54.757123image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:55.365546image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:50.152160image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:50.730114image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:51.298468image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:52.574758image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:53.164722image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:53.718968image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:54.273000image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:54.822212image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:55.427798image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:50.214114image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:50.790380image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:51.360227image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:52.635963image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:53.224842image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:53.781277image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:54.334051image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:54.881485image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:55.489288image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:50.270702image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:50.848332image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:51.420662image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:52.694808image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:53.280473image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:53.836903image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:54.385698image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:54.939139image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:55.548727image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:50.339402image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:50.908683image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:51.482137image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:52.756980image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:53.342358image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:53.898674image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:54.444963image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-12-14T22:20:54.997921image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Correlations

2024-12-14T22:20:57.728085image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
ChloraminesConductivityHardnessOrganic_carbonPotabilitySolidsSulfateTrihalomethanesTurbidityph
Chloramines1.000-0.017-0.025-0.0120.077-0.0550.0370.018-0.008-0.042
Conductivity-0.0171.000-0.0330.0210.0000.021-0.022-0.0040.0100.017
Hardness-0.025-0.0331.0000.0030.079-0.053-0.095-0.012-0.0130.116
Organic_carbon-0.0120.0210.0031.0000.0150.0180.020-0.008-0.0250.044
Potability0.0770.0000.0790.0151.0000.0250.1510.0000.0000.084
Solids-0.0550.021-0.0530.0180.0251.000-0.154-0.0200.028-0.075
Sulfate0.037-0.022-0.0950.0200.151-0.1541.000-0.031-0.0190.024
Trihalomethanes0.018-0.004-0.012-0.0080.000-0.020-0.0311.000-0.0280.005
Turbidity-0.0080.010-0.013-0.0250.0000.028-0.019-0.0281.000-0.049
ph-0.0420.0170.1160.0440.084-0.0750.0240.005-0.0491.000

Missing values

2024-12-14T22:20:55.630362image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
A simple visualization of nullity by column.
2024-12-14T22:20:55.733266image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-12-14T22:20:55.811234image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

phHardnessSolidsChloraminesSulfateConductivityOrganic_carbonTrihalomethanesTurbidityPotability
0NaN204.89045520791.3189817.300212368.516441564.30865410.37978386.9909702.9631350
13.716080129.42292118630.0578586.635246NaN592.88535915.18001356.3290764.5006560
28.099124224.23625919909.5417329.275884NaN418.60621316.86863766.4200933.0559340
38.316766214.37339422018.4174418.059332356.886136363.26651618.436524100.3416744.6287710
49.092223181.10150917978.9863396.546600310.135738398.41081311.55827931.9979934.0750750
55.584087188.31332428748.6877397.544869326.678363280.4679168.39973554.9178622.5597080
610.223862248.07173528749.7165447.513408393.663396283.65163413.78969584.6035562.6729890
78.635849203.36152313672.0917644.563009303.309771474.60764512.36381762.7983094.4014250
8NaN118.98857914285.5838547.804174268.646941389.37556612.70604953.9288463.5950170
911.180284227.23146925484.5084919.077200404.041635563.88548117.92780671.9766014.3705620
phHardnessSolidsChloraminesSulfateConductivityOrganic_carbonTrihalomethanesTurbidityPotability
32668.372910169.08705214622.7454947.547984NaN464.52555211.08302738.4351514.9063581
32678.989900215.04735815921.4120186.297312312.931022390.4102319.89911555.0693044.6138431
32686.702547207.32108617246.9203477.708117304.510230329.26600216.21730328.8786013.4429831
326911.49101194.81254537188.8260229.263166258.930600439.89361816.17275541.5585014.3692641
32706.069616186.65904026138.7801917.747547345.700257415.88695512.06762060.4199213.6697121
32714.668102193.68173547580.9916037.166639359.948574526.42417113.89441966.6876954.4358211
32727.808856193.55321217329.8021608.061362NaN392.44958019.903225NaN2.7982431
32739.419510175.76264633155.5782187.350233NaN432.04478311.03907069.8454003.2988751
32745.126763230.60375811983.8693766.303357NaN402.88311311.16894677.4882134.7086581
32757.874671195.10229917404.1770617.509306NaN327.45976016.14036878.6984462.3091491